Proteome Profiling Outperforms Transcriptome Profiling for Coexpression Based Gene Function Prediction*
نویسندگان
چکیده
Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this "guilt-by-association" (GBA) approach. Recent advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC). Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies.
منابع مشابه
Identification of coexpressed gene clusters in a comparative analysis of transcriptome and proteome in mouse tissues.
A major advantage of the mouse model lies in the increasing information on its genome, transcriptome, and proteome, as well as in the availability of a fast growing number of targeted and induced mutant alleles. However, data from comparative transcriptome and proteome analyses in this model organism are very limited. We use DNA chip-based RNA expression profiling and 2D gel electrophoresis, co...
متن کاملSystematic enrichment analysis of microRNA expression profiling studies in endometriosis
Objective(s): The purpose of this study was to conduct a meta-analysis on human microRNAs (miRNAs) expression data of endometriosis tissue profiles versus those of normal controls and to identify novel putative diagnostic markers. Materials andMethods: PubMed, Embase, Web of Science, Ovid Medline were used to search for endometriosis miRNA expression profiling studies of endometriosis. The miRN...
متن کاملDataset of target mass spectromic proteome profiling for human chromosome 18
Proteome profiling is a type of quantitative analysis that reveals level of protein expression in the sample. Proteome profiling by using selected reaction monitoring is an approach for the Chromosome-centric Human Proteome Project (C-HPP). Here we describe dataset generated in the course of the pilot phase of Russian part of C-HPP, which was focused on human Chr 18 proteins. Proteome profiling...
متن کاملUtilizing RNA-Seq data for de novo coexpression network inference
MOTIVATION RNA-Seq experiments have shown great potential for transcriptome profiling. While sequencing increases the level of biological detail, integrative data analysis is also important. One avenue is the construction of coexpression networks. Because the capacity of RNA-Seq data for network construction has not been previously evaluated, we constructed a coexpression network using striatal...
متن کاملThe significance of gene profiling in diagnosis the cause of drug resistance in cancer
Chemoresistance is one of the main obstacles to the success of cancer treatment and one of the most important causes of death in patients. In the last decade, progress in high-throughput technologies, including microarray, sequencing, and bioinformatics has greatly resulted in cancer gene profiling and identification of biomarkers for cancer prognosis and prediction. This has greatly improved t...
متن کامل